NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Hermes: Algorithm-System Co-design for Efficient Retrieval-Augmented Generation At-Scale

https://doi.org/10.1145/3695053.3731076

Shen, Michael; Umar, Muhammad; Maeng, Kiwan; Suh, G Edward; Gupta, Udit (June 2025, ACM)

Free, publicly-accessible full text available June 20, 2026
Efficient Memory Side-Channel Protection for Embedding Generation in Machine Learning

https://doi.org/10.1109/HPCA61900.2025.00041

Umar, Muhammad; Marathe, Akhilesh Parag; Gupta, Monami Dutta; Ghosh, Shubham Jogprakash; Suh, G Edward; Xiong, Wenjie (March 2025, IEEE)

Free, publicly-accessible full text available March 1, 2026
FLIQS: One-Shot Mixed-Precision Floating-Point and Integer Quantization Search

Dotzel, Jordan; Wu, Gang; Li, Andrew; Umar, Muhammad; Ni, Yun; Abdelfattah, Mohamed S; Zhang, Zhiru; Cheng, Liqun; Dixon, Martin G; Jouppi, Norman P; et al (September 2024, Openreview)

Full Text Available
Prospects and Challenges of Inertia Emulation Methods for Low Inertia Power Systems

https://doi.org/10.1109/SGRE59715.2024.10428899

Umar, Muhammad F; Nazari, Amirhosein Gohari; Shadmand, Mohammad B (January 2024, IEEE)

Full Text Available
Resilient Operation of Grid-Forming Inverters Under Large-Scale Disturbances in Low Inertia Power System

https://doi.org/10.1109/OJIES.2024.3501078

Umar, Muhammad F; Gohari_Nazari, Amirhosein; Shadmand, Mohammad B; Abu-Rub, Haitham (January 2024, IEEE Open Journal of the Industrial Electronics Society)

Full Text Available
Coherency Enforcement in the Cluster of Heterogeneous Grid-forming and Grid-following Inverters

https://doi.org/10.1109/ECCE53617.2023.10362336

Umar, Muhammad F; Gohari_Nazari, Amirhosein; Shadmand, Mohammad B (October 2023, IEEE)

Full Text Available
Coherency-based Coordination Scheme to Mitigate Adverse Dynamic Interaction of Grid-Forming Inverters

https://doi.org/10.1109/ECCE53617.2023.10362147

Gohari, Amirhosein; Umar, Muhammad Farooq; Shadmand, Mohammad B (October 2023, IEEE)

Full Text Available
Creating a biomedical knowledge base by addressing GPT inaccurate responses and benchmarking context

https://doi.org/10.1101/2024.10.16.618663

Darnell, S Solomon; Overall, Rupert W; Guarracino, Andrea; Colonna, Vicenza; Villani, Flavia; Garrison, Erik; Isaac, Arun; Muli, Priscilla; Muriithi, Frederick Muriuki; Kabui, Alexander; et al (October 2024, bioRxiv)

We created GNQA, a generative pre-trained transformer (GPT) knowledge base driven by a performant retrieval augmented generation (RAG) with a focus on aging, dementia, Alzheimer’s and diabetes. We uploaded a corpus of three thousand peer reviewed publications on these topics into the RAG. To address concerns about inaccurate responses and GPT ‘hallucinations’, we implemented a context provenance tracking mechanism that enables researchers to validate responses against the original material and to get references to the original papers. To assess the effectiveness of contextual information we collected evaluations and feedback from both domain expert users and ‘citizen scientists’ on the relevance of GPT responses. A key innovation of our study is automated evaluation by way of a RAG assessment system (RAGAS). RAGAS combines human expert assessment with AI-driven evaluation to measure the effectiveness of RAG systems. When evaluating the responses to their questions, human respondents give a “thumbs-up” 76% of the time. Meanwhile, RAGAS scores 90% on answer relevance on questions posed by experts. And when GPT-generates questions, RAGAS scores 74% on answer relevance. With RAGAS we created a benchmark that can be used to continuously assess the performance of our knowledge base. Full GNQA functionality is embedded in the freeGeneNetwork.orgweb service, an open-source system containing over 25 years of experimental data on model organisms and human. The code developed for this study is published under a free and open-source software license athttps://git.genenetwork.org/gn-ai/tree/README.md.
more » « less
Full Text Available
MGX: Near-zero Overhead Memory Protection for Data-intensive Accelerators

https://doi.org/10.1145/3470496.3527418

Hua, Weizhe; Umar, Muhammad; Zhang, Zhiru; Suh, G. Edward (June 2022, Proceedings of the 49th Annual International Symposium on Computer Architecture)

Full Text Available
SoftVN: efficient memory protection via software-provided version numbers

https://doi.org/10.1145/3470496.3527378

Umar, Muhammad; Hua, Weizhe; Zhang, Zhiru; Suh, G. Edward (June 2022, International Symposium on Computer Architecture)

Full Text Available

« Prev Next »

Search for: All records